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Abstract: Inhibitors of human DNA methyltransferases (DNMT) are of increasing interest 
to develop novel epi-drugs for the treatment of cancer and other diseases. As the number of 
compounds with reported DNMT inhibition is increasing, molecular docking is shedding 
light to elucidate their mechanism of action and further interpret structure-activity 
relationships. Herein, we present a structure-based rationalization of the activity of 
SWl 55246, a distinct sulfonamide compound recently reported as an inhibitor of human 
DNMTl obtained from high- throughput screening. We used flexible and induce-fit 
docking to develop a binding model of SWl 55246 with a crystallographic structure of 
human DNMTl. Results were in excellent agreement with experimental information 
providing a three-dimensional structural interpretation of 'activity cliffs', e.g., analogues of 
SWl 55246 with a high structural similarity to the sulfonamide compound, but with no 
activity in the enzymatic assay. 
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1. Introduction 

DNA methyltransferases (DNMTs) are a family of enzymes that catalyze the transfer of a methyl 
group from 5'-adenosyl-L-methionine (SAM) to the carbon-5 position of cytosine residues, leading to 
an epigenetic modification [1]. As part of the mechanism of methylation, 5'-adenosyl-L-homocysteine 
(SAH) is generated. Three active forms of DNMT have been identified in mammals: DNMTl, 
DNMT3A/3B, and DNMT3L. DNMTl, which is the most abundant of the three, is involved in the 
maintenance of methylation patterns, whereas DNMT3A and DNMT3B are responsible for de novo 
methylation [2]. DNMT3L is required for the catalytic activity of DNMT3A and DNMT3B, though it 
lacks catalytic activity [3]. These enzymes regulate gene expression. In particular, DNMTl is 
responsible for duplicating patterns of DNA methylation during replication and is essential for 
mammalian development and cancer cell growth [4]. Since inappropriate methylation is thought to be a 
key antecedent step in transformation [5], it is anticipated that DNA hypomethylating drugs that act on 
DNMTs may be effective anti-cancer agents. DNMT inhibitors are also promising new drugs for the 
treatment of brain disorders [6]. There have been rapid synthetic approaches based on the conjugation 
of known inhibitors such as procainamide-RG108 hybrid (Figure 1). Procainamide is a potential 
DNMT inhibitor approved by the FDA as antiarrhythmic, and RG-108 was identified via virtual 
screening (Figure 1). Currently, 5-azacytidine and 5-aza-2'-deoxycytidine are the only two DNMT 
inhibitors clinically in use for the treatment of certain types of cancer [7]. Since there are concerns 
about the low specificity and clinical toxicity of these nucleoside analogues [7] it is convenient to 
identify novel non-nucleoside DNMT inhibitors. Compounds with different chemical classes are 
associated with demethylating activity, and some of them were proposed as DNMT inhibitors (Figure 1). 
Most of these compounds were identified fortuitously. Therefore, there is an increased interest to 
identify novel DNMT inhibitors using systematic computational and experimental screening of 
chemical databases. For example, our group identified NSC 14778 (Figure 1) and other DNMTs with 
distinct chemical scaffolds using virtual screening followed by experimental validation [8]. NSC 
14778 was the starting point to identify olsalazine as a novel hypomethylating agent using a 
computer- guided drug repurposing strategy [9]. The increased availability of crystallographic 
structures of DNMTs have boosted the use of molecular docking and other structure-based 
computational approaches to suggest hypothesis of the binding mode of DNMT inhibitors [10,1 1]. 

Experimental high-throughput screening (HTS) is starting to be used as an approach to identify 
novel inhibitors of DNMTs [12]. A recent HTS used a scintillation proximity assay to evaluate 
-180,000 molecules; the hit confirmation rate was low (0.03%) and most of the hits were found to be 
active due to the generation of reactive oxygen species. Only SW155246 (Figure 2) showed human 
DNMTl activity (IC50 =1.2 |iM) without affecting protein levels or generating reactive oxygen 
species [13]. A focused structure-activity relationship (SAR) analysis showed that the hydroxyl group 
of SW155246 was essential for its activity; loss of the hydroxyl group (SW155246-1) or addition of a 
methylated oxygen on the 1 -position of the naphthyl ring (SW155246-2) (Figure 2) completely 
abolished the ability of this compound to inhibit human DNMTl activity in vitro and reduced the 
cell-based cytotoxicity [13]. This is an example of an 'activity cliff [14,15], i.e., a small change in the 
structure dramatically affects the biological activity. However, the binding mode of SWl 55246 with 
DNMTl and the corresponding rationalization of such activity cliffs have not been reported. 
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Figure 1. Selected compounds associated with DNA methyltransferases (DNMT) 
inhibition and hypomethylating agents. 




Figure 2. Chemical structures of SW155246 and structural analogues studied in this work. 




SW1 55246 SW1 55246-1 S W1 55246-2 



In this work, we elucidate the binding mode of SWl 55246 with human DNMTl providing a 
structure-based interpretation of the observed SAR i.e., loss of activity upon removal or methylation of 
the hydroxyl group. For this purpose, we used molecular docking with a crystallographic structure of 
human DNMTl recently published. In order to account for protein flexibility, we conducted 
induced- fit docking (IFD). We have previously used IFD to model other DNMT inhibitors [16]. The 
binding mode of SW155246 was compared to the observed binding mode of the potent co-crystal 
inhibitor sinefungin. Of note, SWl 55246 is a sulfonamide compound and related sulfonamides have 
shown important bioactivities [17]. We anticipate that the outcome of this work may help to design 
novel sulfonamides as inhibitors of DNMTs. 
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2. Results and Discussion 

Docking of SWl 55246 and their structural analogues with human DNMTl was preceded by 
validation of the docking approach. 

2.1. Validation of the Docking Protocol 

In order to validate the docking protocol, we re-dock the co-crystal inhibitor sinefungin with the 
crystal structure of human DNMTl (PDB ID: 3SWR). The docking protocol, described in the 
Experimental Section, successfully reproduced the bound conformation of sinefungin with a root-mean 
square deviation (RMSD) of 0.547 A. Figure 3 shows a comparison of the predicted binding mode 
with Glide XP with the bound position observed in the crystallographic structure. Results were in 
agreement with previous reports that showed the capability of Glide XP to reproduce the co-crystal 
position of sinefungin [16]. 

Figure 3. Validation of the docking protocol: comparison of the co-crystal (yellow) v^. 
predicted (orange) binding mode of sinefungin with Glide XP. 




2.2. Flexible Docking 

First we did regular docking of the compounds in Figure 2 considering the ligands as flexible but 
retaining the protein as a rigid structure. The rank-order of the Glide XP docking scores of SW155246 
and structural analogues with human DNMTl were consistent with the reported activity. SWl 55246 
showed the lowest, i.e., most favorable Glide XP docking score (-4.817 kcal/mol) as compared to the 
analogues SW155246-1 (-4.312 kcal/mol) and SW155246-2 (-4.370 kcal/mol) (Table 1). However, 
as discussed in the next section, the magnitude of the docking scores and predicted binding modes for 
all three molecules were different from the results obtained with IFD, in particular for the compounds 
with the hydroxyl and methoxy groups. 

Table 1. Docking scores as computed with Glide XP using flexible docking and 
induced-flt docking (IFD). The IFD scores are also reported. 



Compound 


Flexible docking 


IFD 




Glide XP 


Glide XP 


IFDScore 


SW155246 


-4.817 


-5.858 


-1009.528 


SW155246-1 


-4.312 


-5.244 


-1008.860 


SW155246-2 


-4.370 


-5.819 


-1009.967 
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2.3. Induced-Fit Docking 

It is well-known that protein flexibility plays an important role in the protein-ligand recognition 
process. For DNMT inhibitors, we previously showed that IFD may have a significant impact on the 
predicted binding modes and docking scores [16]. To consider protein flexibility we carried out IFD 
with the three compounds in Figure 2. An alternative approach to address protein flexibility is to 
conduct docking with different structures available for human DNMTl. In this work, we focused on 
IFD as this approach provided an excellent agreement with experimental data. Table 1 summarizes the 
Glide XP and IFDScores obtained with the IFD protocol detailed in the Experimental Section. This 
table clearly shows that for all three compounds, the Glide XP scores computed with IFD were more 
favorable than the scores obtained with regular docking where only the ligands are considered flexible 
but the protein is kept fixed during docking. Similar results were obtained for other DNMT inhibitors [16]. 
These results further highlight the importance of considering the flexibility of the protein. Although the 
difference in Glide XP scores between SWl 55246 and the two analogues was small, the active 
compound showed the lowest score in agreement with the experimental data. However, despite the fact 
that significant progress has been made to improve scoring functions, it is well-known that the docking 
scores still have a large room from improvement [18]. Therefore we focused the analysis of the 
binding modes predicted with IFD. 

Figure 4. (a) Induced- fit docking pose of SWl 55246 (carbon atoms in green) with human 
DNMTl. The co-crystal sinefungin (carbon atoms in yellow) is shown for reference; 
(b) Two-dimensional interaction diagram of the binding model of SW155246. Acidic, 
hydrophobic, basic, polar, and other residues at the active site are represented by red, 
green, purple, blue, and gray spheres, respectively. Hydrogen bonds between the ligand 
and backbone or side chains are shown in dashed pink lines. The Ti-cation interaction is 
shown with a red line. 




Figure 4 shows three- and two-dimensional representations of the predicted binding mode obtained 
with IFD for SW155246. The structure of the co-crystal sinefungin that binds in the co-factor binding 
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site is shown in Figure 4a for reference. Figure SI in the Supporting Information shows a comparison 
of the predicted binding mode of the active compound SW155246 and their structural analogues 
SW155246-1 and SW155246-2. A major difference between the predicted binding modes for the three 
sulfonamides is that the active compound SW155246 occupies part of the co-factor binding site as well 
as the catalytic site (Figure SI). The hydroxyl group of SWl 55246 makes a distinct hydrogen bond 
interaction with the side chain of Asnl267 in the ENV motif of the catalytic site [19]. Of note, 
Ansl267 is adjacent to the amino acid Glul266 that participates directly in the mechanism of cytosine 
C5 methylation by stabilizing the substrate [19]. An additional hydrogen bond interaction was 
observed between the nitro group of SW155246 and the backbone of Vail 580. A 7i-cation interaction 
is predicted with Argl312 in the RXR motif. Argl312 participate in the mechanism of cytosine-C5 
methylation [20]. Interactions between Argl312 and several other DNMTl inhibitors, including 
5-azacytidine (Figure 1) have been reported [16,20]. In contrast, the two inactive compounds are 
predicted to occupy only the binding site of the co-factor (Figure SI). 

In addition to the different binding energies obtained with regular and IFD for the three sulfonamide 
compounds (Table 1), the binding poses obtained with regular docking were considerable different 
from the poses obtained with IFD (Figure S2 in the Supporting Information). In particular the orientation 
in the binding pocket of compounds SWl 55246 and the methoxy analogue were significantly 
different. Of note, in the binding orientation obtained with regular docking, the active compound does 
not occupy part of the catalytic site (Figure S2). Residues that had the largest mobility after IFD were 
Argl574, Glnl227 and Argl312 in the docking with SW155246, and also Asnl578 in the docking 
with SW155246-1 and SW155246-2 (data not shown). These results further emphasize the importance 
of IFD to generate binding poses that are consistent with the observed experimental activity. 

SW155246 and their structural analogues are examples of three-dimensional activity cliffs, e.g., 
compounds with very similar structure but different binding mode [21]. For the series of compounds 
studies in this work, while the hydroxyl-containing molecule is active, methylation of loss of the OH 
group abolish activity. The substantial different binding mode predicted with IFD may explain, at least 
in part, the very different biological activity. We have used docking to interpret activity cliffs for other 
targets [22]. 

3. Experimental Section 

The protein structure of human DNMTl bound to sinefiingin (PDB ID: 3SWR) was prepared using 
the Protein Preparation Wizard [23] implemented in Maestro [24] with the following steps: (i) Water 
molecules and all ligand atoms except sinefungin were removed; (ii) Hydrogen atoms were added; 

(iii) Protonation states of entire systems were adjusted to the pH range of 7.0 +/- 3.0 using Epik; 

(iv) Hydrogen bond networks and flip orientations/tautomeric states of Gin, Asn, and His residues 
were optimized; (v) The geometry optimization was performed to a maximum RMSD of 0.1 A with 
the OPLS2005 force field. The chemical structures of SWl 55246 and its analogues were built using 
Molecular Operating Environment (MOE), version 2013.08 [25]. The chemical structure of sinefungin 
was extracted from the crystallographic structure; hydrogen atoms were added and the most favorable 
protonation state was calculated with Protein Preparation Wizard. 
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For flexible docking, the grid box was centered on the coordinates of sinefugin to include the 
cofactor and substrate binding sites (setting the option dock ligands with length of 21 A — the inner box 
had a size of 10 A and the outer box a size of 31 A) Docking was conducted using Glide XP 
(extra precision) [26]. Docked poses with the best (lowest) Glide XP score were selected for analysis. 

For IFD, we employed the IFD protocol implemented in the Schrodinger software suite [27-29]. 
The ligands in Figure 2 were docked into the crystal structure of human DNMTl using the following 
steps: (i) The receptor grid was defined as an enclosing box at the centroid of the top ranked 
conformation of SWl 55246 using regular docking using an automated-generated box size by the IFD 
protocol; (ii) In the initial Glide docking stage, a soften potential docking with van der Waals radii 
scaling of 0.5 for the protein and ligand was performed retaining a maximum number of 20 poses per 
ligand; (iii) Residues within 5.0 A of ligand poses were kept free to move in the Prime refinement step, 
and the side chains were further optimized; (iv) Poses within 30 kcal/mol of the lowest energy 
structure obtained in the previous step were re-docked using Glide XP; (v) The binding energy 
(IFDScore) for each output pose was computed as implemented in the IFD protocol. This score is the 
sum of the GlideScore from the redocking step and 5% of the Prime energy from the refinement 
calculation [27]. The most favorable binding conformations of each ligand complex, as measured by 
the IFDScore, were selected for analysis. 

4. Conclusions 

In this Communication, we report the binding model of SW155246 with human DNMTl. Using 
IFD we provide a structural basis of the large difference observed in activity of this distinct 
sulfonamide inhibitor with closely related structural analogues obtained from HTS. We concluded that 
IFD had a significant impact on the docking scores and the predicted binding modes as compared to 
docking where the structure of the protein is kept fixed. Notably, the active compound SWl 55246 
showed a significantly different binding mode as compared to the structural analogues. We concluded 
that SW155246 occupy part of the co-factor binding site as well as the catalytic site making a distinct 
hydrogen bond interaction with Asnl267. IFD was employed to account for protein flexibility and 
helped to provide a structural explanation of three-dimensional activity cliffs. Other approaches that 
could be used in future work for modeling protein flexibility is to conduct docking with multiple 
conformations of human DNMTl obtained through crystallographic information or molecular 
dynamics. Understanding the SAR, in particular subtle changes that have a large impact on the 
activity, has a major importance to optimize the activity of novel inhibitors [30]. This work also 
illustrates the synergy between computational studies and reported experimental screening to further 
advance the knowledge of protein-ligand interactions relevant to a promising epigenetic target. 
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